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Abstract 

In 2013, Courtade and Kumar posed the following problem: Let x ^ {±1}" be uniformly 
random, and form y ^ {±1}"' by negating each bit of x independently with probability a. Is 
it true that the mutual information I{f{x) ; y) is maximized among / : {il}" —>■ {±1} by 
f{x) = xil We do not resolve this problem. Instead, we resolve the analogous problem in the 
settings of Gaussian space and the sphere. Our proof uses rearrangement. 

1 The Courtade—Kumar Conjecture 

In 2013, Courtade and Kumar [KC13, CK14] made the following conjecture: 

The Courtade—Kumar “Most Informative Boolean Function” Conjecture. 

Let X ~ {±1}’^ be uniformly random and form y ~ {±1}’^ by negating each bit of x indepen¬ 
dently with probability a. Then for any f : {±1}"’ ^ {±1} it holds that I{f{x) ; y) < 1 — h{a). 
(This bound is achieved by any f of the form f{x) = ixj.j 

The conjecture attracted fairly widespread attention; it is currently unresolved (though [CK14] 
verifies it for n < 7). Courtade offers a prize of $100 for a proof or disproof [Coul4]. 

Let us briefly discuss the notation used in this problem. First, we henceforth assume a < ^, as 
it’s easy to see the problem is unchanged if a is replaced by 1 —a. The mutual information I(A;B) 
of two discrete random variables is defined to be H{B) — H{B\A). Here H{B) denotes entropy, 
namely H{B) = = b] ) (with log = log 2 ), and H{B\A) denotes conditional 

entropy, namely the expected value of H{B \ A = a) when a is distributed as A. For /3 G [0,1] we 
write h{/3) = ft log(^) + (1 — /?) log(yXg) for the entropy of the two-valued random variable that is 
— 1 with probability (5 and -|-1 with probability 1 — (5. We will also be using traditional notation 
from the field of analysis of Boolean functions [0’D14]. In particular, recall that {x,y) is said to 
be a pair of p-eorrelated random strings, where p = E[a;jyJ = 1 — 2a > 0 (and (y, x) has the same 
distribution). Also recall that for / : {il}” —)• R, the function Tpf : {±1}"' —> R is defined by 
Tp/(a^) = E[/(y) \ x = x\. Note that E[Tp/] = E[/] (where we use the shorthand E[y] = E[y(£c)]). 
Using this notation, and defining for convenience 

: [-1,1] [0,1], ^{t) = 1 - /i(J - F) = 172 ■ ( 2 T ■ ^ + BT ■ -) (^) 
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we have 


I{f{x) ; y) = H{f{x)) - H{f{x)\y) = h{\ + \ E[/]) - + ^T,/(a^))] 

= E[cI>(T,/(a^))] - ^E[T,f{x)]) = Bnt^[T,f], 

where in the last equality we are using the <h-entropy notation from, e.g., [Cha04]. Thus we have 
the following equivalent formulation: 

Courtade—Kumar Conjecture (equivalently). For f : {±1}” —5- {±1} and p G [0,1] it holds 
that Ent'*’[Tp/] < where is as in (1). 

We remark that is very close to the function t t^, and that the analogous statement 

Ent*^‘'[Tp/] = Var[Tp/] < = {I - 2a)\ 

(with equality if and only if f[x) = Txj, presuming 0 < jpj < 1) has a rather trivial Fourier-theoretic 
proof. (Combine [0’D14, Prop. 1.13, Prop 2.47, Ex. 1.19(a)].) 

1.1 Prior work 

The Courtade-Kumar Conjecture is a very natural one in information theory and the analysis 
of Boolean functions. Courtade and Kumar report that their original motivation came from the 
work [KKBS14], which observed that among / : {±1}’^ ^ {=tl} with E[/] = /r > 0, the quantity 
I{f{x) ; xi) is maximized by those / with f{x) > xi. In turn, [KKBS14] was motivated by 
a work [SJ08] on the regulatory network of E. coli. A connection between the conjecture and 
cryptography is discussed in [CVM"''13]. Finally, Courtade and Kumar also offered a motivation 
from gambling (stock markets, horse races), and in fact closely related problems were studied earlier 
by Erkip and Cover [EC98]. In [CK14] the weaker result I{f{x) ; y) < (1 — 2a)^ = p^ is attributed 
to Erkip [Erk96]. 

There are some natural weakenings of the conjecture that are still open. For example, it is 
natural to expect that maximizing / are unbiased, meaning E[/] = 0. However, the conjecture re¬ 
mains open even under this assumption. Courtade and Kumar also left open the weaker conjecture 
“/(/(s) ; g{y)) < 1 — h{a) for f,g : {±1}*^ —{±1}”, but remarked that it is an exercise assuming 
both / and g are unbiased. Bogdanov and Nair [BN13] have apparently proved this weaker con¬ 
jecture under the assumption that f = g (and a > ^); see also [AGKN13], in which the weaker 
conjecture is reduced to an explicit three-dimensional numerical inequality which, empirically, ap¬ 
pears to be true. Courtade and Kumar also proved the weakening Y17=i Hfi^) ! Vi) < 1 “ h{a) 
under the assumption that / is unbiased. 

Certain strengthenings of the Courtade-Kumar Conjecture have also been considered; see, 
e.g., the information theory work [CVM14]. Another interesting example comes from the work of 
Chandar and Tchamkerten [CT14], who considered the more general conjecture 

< 1 _ h{a) for all / : {±1}" ^ {±1}^ (2) 

k 

Chandar and Tchamkerten generalized the Erkip-Cover bound by showing that one can take 
(1 — 2 q:)^ on the right-hand side above, for all k. However they also showed that (2) is too strong; 
in fact, a right-hand side of (1 — 2a)^ can be achieved in the limit when first n —)• oo and then 
k —>■ oo. In particular, by taking / to be the indicator of certain perfect codes, they showed that (2) 
can fail when, e.g., n = 15, /c = 11, a G [0.05,0.5]. 
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In recent work, Ordentlich, Shayevitz, and Weinstein [OSW15] showed that the Courtade- 
Kumar Conjecture holds for unbiased functions when a is very close to 0 or In particular, they 
proved that the conjecture is true with no restrictions on / for a G [0,a„] such that —)• 0 as 
n —oo. For a G with 0 as n —)• 0, they showed that the conjecture holds 

under the additional assumption that / is unbiased. They also improved the bound of [Erk96] for 
unbiased functions /, showing that in this case 

1 - (1 - 2a)^ 

is bound approaches 1 — h{a) as a ^ 

2 A problematic approach to the conjecture 

It is natural to attempt to strengthen the Courtade-Kumar Conjecture by determining the maxi¬ 
mum value of I{f{x) ; y) among functions of each fixed mean y, = E[/]. For example, one might 
try to prove the equivalent formulation in terms of Ent'*’ by an induction on n (or tensorization), 
as discussed in [Cha04]. Although the maximizing / for the original conjecture presumably occurs 
for /i = 0, an inductive approach would lead to subfunctions of / which wouldn’t necessarily have 
mean 0. 

Indeed, Courtade and Kumar made such a stronger conjecture, discussed in this section. In 
discussing this generalization of the problem, we will find it convenient to switch notation, now 
thinking of / : {±1}” ^ {0,1}. 

Courtade—Kumar Lex Conjecture. Fix n and let (x, y) be p-correlated n-bit strings. Among 
all functions f : {Tl}” ^ {0,1} with a fixed mean E[/] = y, the mutual information I{f{x) ; y) 
is maximized when f is “lex”; i.e., the indicator of the first /i2"' points of {Tl}” in lexicographic 
ordering. 

Remark 2.1. In particular, if y is of the form 2“^ for some integer 0 < k < n, the conjecture is 
that a maximizing / is an indicator of a A:-codimensional subcube; equivalently, a logical A:-AND 
function. 

If true, this Lex Conjecture would essentially resolve the original conjecture. We remark that 
when / is a A:-AND function as in Remark 2.1, it’s not hard to calculate that I{f{x) ; y) has the 
simple form A;2^“^(l — h{a)), making the Lex Conjecture particularly tempting. Unfortunately, 
Chandar and Tchamkerten [CT14] showed that the Lex Conjecture fails. Specifically, they showed 
that for each a there exists /c G N such that /c-AND functions do not maximize I{f{x) ; y) among 
/ : {il}” {0,1} of mean 2~^ (assuming n is sufficiently large). In particular, they showed that 

indicators of (essentially) Hamming spheres do better. 

A subsequent version of the Courtade-Kumar paper [CK14] suggested working around this 
counterexample by revising the Lex Conjecture to assume that h{y) > 1 — h{a)\ i.e., that y is not 
too close to 0 or 1. Unfortunately, we show below that this revision does not help. Indeed, we show 
that once y is close enough to 0 (but still “constant”), the Lex Conjecture becomes false as ^ 0 
(which is equivalent to a —^ and hence 1 — h{a) —>■ 0). 

Failure of the Lex Conjecture as /? —>• 0. To see this, first note that among functions / : 
{±1}"' —>■ {0,1} of fixed mean y, maximizing I{f{x) ; y) is equivalent to minimizing 'E[h{Tpf{x))]. 
Recall the Fourier formula 

Tpf = y + pf=^ + p^f=^ + p^f=^ + • • • , 


Hf{x) ; y) < ^^^(1 - 2af + 9 


for a G 


i 1- 


1 i 1 


Fl ’ 2 


The authors point out that th: 
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where f ^ = X]| 5 |=j fi^) Flies Thinking of p —)• 0, we apply the Taylor expansion to h{Tpf{x)) 
and deduce that it is of the form 

h{n) + co{n)f='^ix) ■ p + (ci(^)/=2(x) + C2{p)r^{xf) ■ + {■■■)■ + ■■■ , 

where the Ci{p,ys are certain constants depending only on p. In particular one may check that 
C 2 {p) = ~ 2 in 2 -/j(i-/j) ^ Thus when we take the expectation over x, we find that minimizing 
E[/i(Tp/)] (for p sufficiently close to 0) becomes equivalent to maximizing W^[/] = E[/^^(a;)^] = 
Z^iLi the Fourier weight at degree 1. 

The question of precisely maximizing the Fourier weight at degree 1 among / : {±1}”' —)• {0,1} 
of mean /r is a well-known, difficult one. However, it is a folklore fact that indicators of Hamming 
balls are superior to logical ANDs (i.e., lex functions) when p is sufficiently small. More precisely, 
suppose we fix /r = 2“^ for some k G N"*". Then from [0’D14, Props. 5.24,5.25,5.27] we have that 
W^[ANDfc] = p^ log(^) but that there are Hamming ball indicators fn '■ {0,1} with 

E[/n] ^ {2\u2)pHog{]^) > 1 . 386^2 

Here U denotes the Gaussian isoperimetric function. If k is large enough that > l.38p^ los(p) 

then by taking n large enough and slightly modifying /„ we can ensure that E[/„] = p exactly while 
still retaining W^[/„] > 1.3;U^log(^) = 1.3W^[ANDfc]. Then for p sufficiently close to 0 (i.e., a 
sufficiently close to we will be able to conclude that I{fn{x) ; y) > /(ANDfc(s) ; y). 


3 The problem in continuous settings 

We have shown that resolving the more general conjecture of maximizing I[f{x) ; y) among f of a 
fixed mean looks to be very difficult in the Boolean setting, since even the problem of maximizing 

[/] among / of fixed mean is unsolved. A difficulty with this problem seems to be the lack of 
effective symmetrization techniques in the discrete setting. To combat this we propose investigating 
the Courtade-Kumar problem in natural continuous settings. 

For isoperimetric problems, Gaussian and spherical analogues have been studied extensively. 
The appearance of the Gaussian isoperimetric function above suggests that the Courtade-Kumar 
problem is related to isoperimetric problems and motivates its investigation in Gaussian space and 
on the sphere. In addition, one can think of the Gaussian setting as a special case of the original 
Boolean problem via the Central Limit Theorem. The study of the Gaussian analogue is further 
motivated by the frequent use of Gaussian random variables in other areas of information theory, 
including, for example, in the context of Gaussian channels [CT91]. 

In both the spherical and Gaussian settings, the Courtade-Kumar problem can be stated as 
“What function maximizes H{f{x)) — H{f{x)\y) when x and y are p-correlated vectors?” for the 
appropriate notion of p-correlated vectors. We consider 0/1-valued functions /, but x and y are 
drawn from a continuous domain. We then define H[f{x)\y) = ^y[H{f{x)\y = y)]. For fixed 
mean p, we want to find / maximizing —H{f{x)\y). 


Gaussian space. In this case we define x and y to be p-correlated n-dimensional standard 
Gaussian random vectors. This means that a; is a standard n-dimensional Gaussian random vector 
and y = px \/l — p^z, where 2 is an independent standard n-dimensional Gaussian random 
vector. Equivalently, the pairs {xi^yfj are independent across 1 < i < n and each is distributed 


as a 2-dimensional mean-zero Gaussian with covariance matrix 




In analogy with Tp, we 
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define the Gaussian noise operator Upf{x) = E[/(y) | x = x\. We can then write —H{f{x)\y) = 

[ ^(^p/(®))] • 

We show that halfspaces are most informative in Gaussian space. 

Theorem 3.1. Let f : R"' —)■ {0,1} and let x and y be p-correlated standard Gaussian random 
vectors with 0 < p < 1. Then —H{f{x)\y) < —H{lp{x)\y), where is the indicator of a halfspace 
y such that [lr;(®)] = E 3 ,^n(o,i)" [/(*)]• 

The theorem shows that for fixed mean p, halfspaces of mean p are optimal. We expect that 
halfspaces with mean 1/2 are optimal overall, but were unable to show this. We observe that 
the statement of this theorem is very similar to a general form of Borell’s Isoperimetric Theorem 
[Bor85]: 

Theorem 3.2. [Bor85] Let f : R" —)• {0,1} and 'h : R>o — )■ R &e increasing and convex. Then 


E [^{Upf{x))] < E 


[^{Uplp{x))l 


where Ip is the indicator function of any halfspace y such that E 3 ,^j..f(o,i)" [lp(®)] = [/(*)]• 

Although it may be possible to deduce Theorem 3.1 from Theorem 3.2, we did not see how to 
do this. Our function —h(x) is convex, but is not increasing. 

Our proof follows from the spherical case below via Poincare’s limit. This is the observation that 
a uniform random point on a high-dimensional sphere projected onto a small number of coordinates 
looks Gaussian. The proof idea is from Beckner [Bec92] with details filled in by Garlen and Loss 
[GL90]. 


The sphere. In this case we define x and y to be p-correlated points on the unit sphere S'^~^ in 
n dimensions. This means that a; is a uniformly random point on the surface of and that y 
is the result of a ln(l//9)-time Brownian motion on started at x. Equivalently, y is defined to 
be the first point on hit by a standard n-dimensional Brownian motion started from px. We 
denote the corresponding noise operator by Pp. Then —H{f{x)\y) = 

Once again, halfspaces are most informative. We write x ~ S'^~^ for x drawn uniformly at 
random from the surface of 5’”'“^. 

Theorem 3.3. Let f : ^ {0,1} and let x and y be p-correlated points on the unit sphere 

S'^~^ with 0 < p < 1. Then —H{f{x)\y) < —H{lp{x)\y), where Ip is the indicator of a halfspace 
y such that Ea...^ 5 n-i [l,j(a;)] = Ea...^ 5 rL-i[/(a;)]. 

Again, we believe that halfspaces with mean 1/2 are optimal, but were not able to show this. 

The halfspace Ip is a symmetrization of the corresponding function /. Rather than directly 
proving that this symmetrization increases the mutual information, we show that a much simpler 
notion of symmetrization called polarization increases the mutual information. The halfspace sym¬ 
metrization can be thought of as the limit of repeated polarization and we use an argument of 
Baernstein and Taylor [BT76] to pass from polarizations to halfspaces. 


Noise operators via kernels Here we mention alternative formulations of Upf and Ppf that 
we will use below. In the Gaussian case, the Mehler kernel Up{x,y) is defined as 


Up{x,y) 


1 

(1 — 



p‘^\\x\\^ + 2p{x,y) +p‘^\\y\\‘^ 

2(1-p2) 
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We can write Up in terms of the Mehler kernel: Upf{x) = 'Ey^-^(^Q^i-^n[Up{x, y)f {y)]. 
We define the Poisson kernel Pp in the spherical case: 

Ppix,y)= ^ ^ 


k - py\Y 


Similarly, Ppf{x) = [Pp(x,y)/(y)]. 


4 The spherical case 


Let S'^ ^ be the sphere of radius R in n dimensions. For x = (xi,X 2 ,... ,Xn) G the polar 

angle 9x is the angle between x and r = {R, 0,..., 0). In other words, xi = Rcos Ox- Let uJn-i,R be 
the uniform probability measure on we will omit the subscripts when they are clear from the 

context. Let C{9) denote the spherical cap {x G 5'^”^ : 9x G [0,0)}. For / : —)• R, we define 

the symmetric decreasing rearrangement of / as 

f{x) = inf{f : w(y : /(y) > t) < u){C{9x))}. 


We will show the following result: 

Theorem 4.1. Let m be a uniform measure on s--\ whieh may or may not he normalized. Let 
'k : R —>• R 6e a convex, uniformly continuous function and let f : [0,1] be integrable. Let 

iL : R —)• R 6e a non-decreasing bounded measurable function. Then 

f f K{{x,y))f{y)dm{y)] dm{x) < [ f K {{x, y)) f (y) dm{y)] dm{x). 

Theorem 3.3 is an immediate corollary. 

Proof of Theorem 3.3. Observe that 

-H{f{x)\y)=P‘[-h{Ppf{x))\= [ -h( [ Pp{x,y)f{y)duj{y)\duj{x). 

Since Pp{x,y) is a non-decreasing function of {x,y) and —h is convex. Theorem 4.1 implies that 
this quantity is upper bounded by 

/ -h{Ppf{x))dL 0 ix) = E[-h{Ppf{x))] = -H{f{x)\y). 

Js"-^ 

It is easy to see that f = Ip for some halfspace y such that E[/] = E[l^]. □ 


Following Baernstein and Taylor [BT76], we prove Theorem 4.1 for a simpler symmetrization 
called a polarization. The symmetric decreasing rearrangement can be thought of as the limit of 
repeated polarizations, so we obtain the desired result. 

We now define the polarization operation. Let a be a hyperplane through the origin that does 
not pass though r. Let be the hemisphere defined by a that contains r and let H~ be the other 
hemisphere defined by a. For x G S'}}, we will denote the reflection of x across cr as crx. Then the 
polarization of / : > R with respect to a is 


nx) 


max{/(x), fiax)} if x e H+ 
min{/(x), f{ax)} if x G iL". 


To simplify notation, define Kf{x) = J^n-i K{{x,y))f{y) dm{y). We will prove the following 
statement: 
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Theorem 4.2. Under the assumptions of Theorem f.l 


{Kf (x)) dm{x) < [ 'i>{K(x)) dm{x). 


for every hyperplane a passing through the origin that does not contain r. 


As in [BT76], proving this result for polarizations implies the corresponding result for the 
symmetric decreasing rearrangement. 


Lemma 4.3. Under the assumptions of Theorem ^.1, if 


[ {Kf (x)) dm{x) < [ 'It{Kf^(x)) dm{x) 


for every hyperplane a passing through the origin that does not contain r, then 


f {K f (x)) dm{x) < f ^ {K f (x)) dm{x). 

Js^-^ Js^-^ 

The proof of this lemma exactly follows an argument from [BT76]; we include the proof in 
Appendix A for completeness. 

We will now prove Theorem 4.2. First, we will need a couple of lemmas about the interaction 
of these reflections with inner products. 

Lemma 4.4. For G 5'"'“^ and any hyperplane a through the origin, {x,y) = {ax, ay). 


Proof, ax = Ux for some unitary matrix U. The lemma follows. 


□ 


Lemma 4.5. If x & Hf:, then {x,y) > {ax,y) for all y G Hf:. Similarly, if x G , then 
{x,y) < {ax,y) for all y G . 

Proof. Let v be the unit vector perpendicular to the hyperplane a such that v G Hfi. Write 
X = UxV + Vx, where v^ is orthogonal to v. Then ax = —axV + v^. For x,y G 77^, ax, Oy >0 and 
we then have that 


{x,y) = axtty + {Vx,Vy) > -OxUy + {Vx ,Vy) = {ax,y). 

The proof of the second statement is similar. □ 

We will also need a lemma about convex functions. 

Lemma 4.6. Let 'h : IR — )• IR 6e convex and consider x,y,x',y' such that x + y = x' + y' and 
\x' — y'\ > \x — y\. Then 'h(x) + 'h(y) < 4'(a:') + 4'(y')- 

Proof. Assume x' ^ y'] the result is obvious otherwise. Without loss of generality, let y' > x' and 
y > X. It is then clear that x' < x < y < y'. 

Now let A = yiZx' ^ [0) !]• Short calculations show that x = \x'+{l—X)y' and y = (1—A)x'+Ay'. 
By convexity. 


4'(x) = 4'(Ax' + (1 - X)y') < A^(a;') + (1 - A)4'(y') 

4'(y) = 4'((1 - A)x' + Ay') < (1 - A)^(a:') + A^(y'). 

Adding these two inequalities completes the proof of the lemma. □ 
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We now come to the two main lemmas of this section. 


Lemma 4.7. Kf{x) + Kf{ax) = Kf^{x) + Kf^{ax). 

Proof. Expanding definitions and using reflections, we can write Kf(x) + Kf{ax) as 

[ K{{x,y))f{y) + K{{x,ay))f{ay) + K{{ax,y))f{y) + K{{ax, ay)) f {ay) dm{y). 

Jh+ 

By Lemma 4.4, this is equal to f^+{K{{x, y)) + K{{ax, y))){f{y) + /(o'y)) dm{y). 

Similarly, 

Kf^{x)+Kf^{ax)= [ {K{{x,y)) + K{{ax,y))){r{y) + r{ay))dm{y). 

Jh+ 

By the definition of /'^, f{y) + f{ay) = /'^(y) + /^{cry), so the two integrands are equal and the 
lemma follows. □ 

Lemma 4.8. \Kf^{x) — Kf^{ax)\ > \Kf{x) — Kf{ax)\. 

Proof. By similar calculations to those in the proof of the previous lemma. 


Kr{x) - Kr{ax) = [ {K{{x,y)) - K{{ax,y))){r{y) - r{ay))dm{y) 

Jh+ 

Kf{x) - Kf{ax) = [ {K{{x,y)) - K{{ax,y))){f{y) - f{ay))dm{y). 

Jh+ 

First, observe that /'^(y) — f^{o'y) = |/(y) — /(cry)! for y G Hf:. Next, note that for fixed x, 
K{{x,y)) — K{{ax,y)) has the same sign for all y G Hf:. Indeed, if x G then K{{x,y)) > 
K{{ax,y)) for all y G Hf: by Lemma 4.5. Likewise, if x G H~, then K{{x,y)) < K{{ax,y)) for all 
y G Hf'. We can therefore write 


\Kf%x) 



\{K{{x,y)) - K{{ax,y))){f{y) - f{ay))\ dm{y) 
{K{{x,y)) - K{{ax,y))){f{y) - f{ay))dm{y) 


Using Lemmas 4.7 and 4.8, we can now complete the proof of the theorem. 


□ 


Proof of Theorem f.l. 



'^{Kf{x)) dm{x) 



'^{Kf{x)) + "^{Kf{ax)) dm{x) 


< / ^{Kf^{x)) + '^{Kf‘^{ax)) dm{x) by Lemmas 4.7, 4.8, and 4.6 

Jh+ 

= f {Kf'^{x)) dm{x). 

Jsi-^ 


□ 




5 The Gaussian case 


In this section, we will use Theorem 4.1 to prove that halfspaces are most informative in Gaussian 
space. Let 7 be the standard Gaussian measure on IR"', which has density ^^ 7^772 exp ( —i||x|p). 

Theorem 5.1. Let T : IR —s- IR 6 e convex, bounded, and uniformly continuous and let f : IR” — s- 
{0,1}. Let [0,1). Then 

[ '^{Upf{x))d-f{x)< [ '^{Uplp{x))d'y{x), 

JR" JR" 

where Ip is the indicator function of some halfspace p such f/iaf E^-gN^o,!)" [/(*)] = Ea.gN(o,i)" [lr;(®)]- 

Taking T = —h, this immediately implies Theorem 3.1. To reduce clutter, we will write drop 
the factor of and write d'y{x) = exp (—^||ic|P) dx for the rest of this section. 


5.1 The proof idea 

First, we give the intuition behind the proof. For u drawn uniformly at random from the 

projection of u onto its first n coordinates is close to being distributed as an re-dimensional Gaussian 
for large N. This well-known fact is sometimes called Poincare’s observation. We can use this idea 
to transfer results for the sphere to Gaussian space as was done in [Bec92, CL90]. 

To make this plan more concrete, observe that we can write u G as 



where x G and v G Given / : IR” —)• IR, we then define to be the extension of / to 

S^~^. More formally, we define — )• IR such that /®^*(re) = f{ui,U 2 , ■ ■ ■ ,Un). The idea 

of the proof is to show the desired inequality involving / on the sphere for f^^^ and then take the 
limit as N increases to derive the corresponding inequality for /. 

We now give a simple example: For bounded / : IR'^ IR, the expectation of f^^^ on 
converges to the expectation of / in Gaussian space. First, we give a formula for integrating over 
the sphere according to the decomposition in (3). Let sn-i,r be the uniform surface measure on 
S^~^. We will suppress the subscripts, as they will be clear from the context. 

Lemma 5.2. Let g : —)• IR. Then 


L 


g{u)ds{u) = 


iB’l 


g{x,v) 


N-n-3 



ds{v) dx. 


This is essentially shown in, e.g., [ABROl]. 

For the rest of this paper, set R = y/N — n — 0. Then observe that 


lim 

N^OO 




dx = d'y{x). 


Together with Lemma 5.2, this implies that 


lim [ f^^^iu) dxjiu) = f f{x)d'y{x). 

N^oo JgN-i 


9 











The proof of Theorem 5.1 is not quite so simple: the use of the noise operator raises technical 
complications. However, Carlen and Loss [CL90] showed how to overcome these difficulties and 
pass from inequalities involving the spherical noise operator to inequalities involving the Gaussian 
noise operator. We largely follow their treatment, introducing a “Poisson-like” kernel Qp such 
that limAT^oo f duj(u) = f 'I'(Upf(x)) dj(x) and then using Theorem 4.1 to show that 

f^(Qpr-\u))duj(u) <f'^(Qpl^^^(u))duj(u). 


5.2 Rewriting a “Poisson-like” kernel in terms of a “Mehler-like” kernel 

Following [CL90], we will construct Qp on x that converges to the Mehler kernel as N 

increases. Thinking of as the product of and as in (3), Qp will factor into UN,p-Pp' 

such that U]si,p '■ x —)■ R converges to the Mehler kernel and Ppi : x ^ R 

is a Poisson kernel that integrates to 1. 

We will now give formal statements of these ideas. The lemmas in this section are essentially 
given in [CL90]; we include proofs in Appendix B. Recall that p G [0,1). First, define Qp : 
S'p X —)• R so that 

I 11^ 

where is the surface area of The “Mehler kernel” factor of this quantity is 


Up,N{y,z) 


(1 - p2)l-n/2 

(1 - r‘^(y,z))A{y,z)^ 


where 


My, z) = - M -+ 


\ 


1 + P^- 


- pH I- 


R 2 


1 - 


R 2 


and 


r{y,z) = 


p 1- 


2 \ 1/2 




1 - 


2 \ 1/2 




My,z) 

The next lemma shows that Qp can be written as a product of Up^^iy, z) and a Poisson kernel. 
Lemma 5.3. Let u = ^1 — G such that y G B^ and w G as in (3). 

Likewise, let v = ^1 — G such that z G B'^ and x G 5’^“"'“^. Then 


Qp{u,v) = Up^N{y,z) 


\SN 


—n—1 1 


\w — rx\ 


iN-n 


and r G [0,1). 

We address the Mehler and Poisson factors in turn. As N goes to oo, Up^N{y,z) converges to 
the Mehler kernel. 


Lemma 5.4. liuiN^ooUp^Niy, z) = Up{y,z). 
The Poisson kernel factor integrates to 1. 
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Lemma 5.5. f„iv-n-i tt-t? — ^ ^ „jv_„ ds(x) = 1 
Define Qpf{u) = fgN-i Qp{u,v)f{v)duj{v) and 


Up,Nf{y) = l||y||<RC/p,Ar(2/, z)f{z) 


dz. 


In the main lemma of this section, we will use the above lemmas to rewrite the spherical quantity 
f ^(Qpf(u)) du{u) in terms of C/p, at. 


Lemma 5.6. 


i^N—n—1 / 2 \ 

R \ / 


R 

ext / 


dy. 


Proof. Lemmas 5.2 and 5.3 imply that Qpf^^^{u) is equal to 
1 f . f f R{l-r^) 


N-n-3 
| 2 \ 2 


5 


'7V-1 

R 




fiz)Up^Niy,z) 


lsN-n-1 |5Af-n-l| ||y, _ pj. 


dsix) 1 - 


\N-n 


i?2 


dz. 


Lemma 5.5 then shows that Qpf^^^{u) = , jv-ii Up^Nf{x). Applying Lemma 5.2 to the outer integral 

Ph I 

completes the proof. □ 

5.3 Passing from the sphere to Gaussian space 

Using the previous section, we now prove our main lemma. It essentially states that the spherical 
quantity f '^{Qpf{u))duj{u) converges to the Gaussian quantity f ^{Upf{y))d'y{y) that we would 
like to bound. 

Lemma 5.7. limAr^oo fg^-i ^ ■ Qp/®^H^)) doj(u) = '^{Upf{y)) d-f{y). 

To prove this lemma, we will need an additional technical lemma given in [CL90]. 

Lemma 5.8. (^1 - < Md^x). 

We give a proof of this lemma in Appendix B. 

Proof of Lemma 5.7. By Lemma 5.6, it suffices to show that 

( N-n-3 

dy= [ T(C7 p/(y))d7 (y). 

-a J jRn 

First, we prove that limjv^oo Up^^fiy) = Upf{y). For each y, z G R”, Lemma 5.4 implies that 

1 , 


N-n-3 
| 2 \ 2 


\im^lllyll<RUp,Niy,z)f{z) 1 - 


= Up{y,z)f{z)exp ( -^ll^ll 
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We then wish to upper bound 


by an integrable function so 


hy\\<RUp,Niy,z)fiz) (^1 - ^ 
we can apply dominated convergence. Lemma 5.8 implies that r < p and, using the definition of 
Up, TV, we see that 


hv\\<RUp,Niy,z)fiz) 1 - 




< 


1 - 




(1 - p^yl^A-^ 


Applying Lemma 5.8 again shows that the right hand side is at most cexp exp for 

some c that does not depend on z or N. For a given y, this is integrable; dominated convergence 
then implies that limTv^oo t4>,Af/(y) = Upf{y). Since is uniformly continuous, we exchange the 
limit and the application of 'h. Since 'L is bounded, we can apply dominated convergence to the 
outer integral to complete the proof. □ 


We can now prove Theorem 5.1. 


Proof of Theorem 5.1. By Theorem 4.1, 



du}{u) < 






du{u). 


Since is 0/1-valued, is the indicator function of a halfspace t]. By symmetry, we assume 
that rj = {u £ IR'^ ■ ui > t} for some t G IR. Then h depends only on the first coordinate of u and 
Ijj = 1“*, where p' is the halfspace {u G IR” : ui >t}. Using Lemma 5.7 to take the limit on both 
sides, we obtain J^r,^{Upf{y)) dp{y) < T([/pl^/(y)) d 7 (y). 

It remains to show that Ea.giv(o7)r> [/(a;)] = E3,gN(o,i)" To see this, observe that 

JgN-i duj{u) = JgN-i l®^*(u) duj{u). The result then follows from (5.1). □ 

R R 
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A From polarizations to the symmetric decreasing rearrangement 

In this section, we give a proof of Lemma 4.3, which was essentially proven by Baernstein and 
Taylor [BT76]. Our setting is very slightly different, but no new techniques are required and the 
proof exactly follows the ontline of [BT76]. 

Lemma 4.3. Let m be a uniform measure on S'^, which may or may not be normalized. Let 
T : IR —)• IR be a convex, uniformly continuous function and let / : —>■ [0,1] be integrable. Let 
iL : IR —)• IR be a non-decreasing bonnded measurable function. If 

f 'L(iL/(x)) dm(x) < f '^{K{x)) dm{x), 

Js^ Js’^ 

for every hyperplane a passing through the origin that does not contain r = (R, 0,..., 0), then 



'it{Kf (x)) dm{x) < 



"^{Kf{x)) dm{x). 


Proof. Eor brevity, define J{f ) = f{x)) dm{x). As described in [BT76], it suffices to 

consider continuous functions /: Eor any / G there a sequence of continuous functions {/j} 

converging to / in the Li norm. Let C{S'fQ be the set of continuous functions on 5)^; C{Sff) is 
complete under the supremum norm. Recall the definition of the modulus of continuity: 


= sup{|/(x) - f{y)\ : \x - y\ < d,x,y £ 


We can then define 


V = {F£ C{Sf,) : UJ{;F) <uj{;f),F = f, and J(/) < J{F)}. 

Observe that V is nonempty: it contains for all hyperplanes a through the origin. The fact that 
the modulus of continuity decreases under polarizations and / = /”' is given in [BT76, Lemma 1]. 
To prove the lemma, it suffices to show that f £ V. Assume for a contradiction that f ^ V. 
Consider 

D{F)= I {F-Jfdm. 

We will derive a contradiction by showing that for any function h that minimizes D on V with 
h ^ f, we can find another function h' such that D{h') < D{h). To do this, we first need to show 
that D attains a minimum value on V using the Extreme Value Theorem. In order to use this 
theorem, we need to show that V is compact and D is continuous. 

Claim A.l. V is compact under the supremum norm. 
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Proof. We first use the Arzela-Ascoli Theorem to show that V is relatively compact and then show 
that the limit of any convergent sequence of functions in V is also V. 

To apply the Arzela-Ascoli Theorem, we need V to be equicontinuous and uniformly bounded. 
Equicontinuity is immediate from the definition ofV. To see that V is uniformly bounded, observe 
that for any F G V, it holds this |F| < sup 2 ,g 5 n{|/(x)|}. This follows from continuity of F and 

F = f. Since / G it is bounded and thus V is uniformly bounded. 

It remains to show that the limit of any convergent sequence of functions in V is also in V. Let 
be a convergent sequence in V and let limj^ooS'i = 9- Since C{Sff} is complete, it suffices 
to show that oj{-,g) < w(-,/), g = f, and J(/) < J{g). It is clear that uj{-,g) < holds. 

To see that g = f, assume for a contradiction that g{x) > /(x); this is without loss of generality. 
Then there exist t G IR and e > 0 such that m{x ; g{x)^ t + e) > m{x : f{x) > t). The right 
hand side is equal to m{x : gi{x) > t) for all i since gi = /. Then for all i, there exists x such that 
g{x) — gi{x) > e. The contradicts convergence of the giS in the supremum norm. 

Lastly, we show that J{f ) < J{g). Note that the gf?, are uniformly bounded. We can then 
apply dominated convergence and use uniform continuity of 'L to deduce that limj^oo J{9i) = J{9)- 
Since J(/) < J{gi), it must be the case that J(/) < J{g). □ 

Claim A.2. D is continuous. 

Proof. Observe that 


\D{F)-D{G)\ 


{F -G){F + G + 2f)dm 


n 

R 


< sup |F(x) — G(x)| [ |F-|-G-|-2/1 dm. 
xes- Js^ 


Since F, G, and / are bounded, fg„ |F -|- G -|- 2/| dm is bounded and \D{F) — D{G)\ goes to 0 as 
the supremum norm sup^-g^n |F(x) — G(x)| goes to 0. □ 

Using these two claims, the Extreme Value Theorem implies that D attains a minimum value 
on V. Let h ^ f he a minimizing function in V. Now we will derive a contradiction by exhibiting 
a function h' in V such that D{h') < D{h). We will set h' = h" for an appropriately chosen 
hyperplane a. 

Claim A.3. There exists a hyperplane a through the origin and a set B C of positive measure 
sueh that 

f{x) > f{crx) and h{ax) > h{x) 


for all X G B. 


Proof. Since h = f but h ^ f, h must not be symmetric decreasing. That is, there must exist some 
t such that E = {x : h{x) > t} is not equal to G = {x : /(x) > t}. We know that / and h are 
continuous and that m{E) = m{G), so both E\G and G\E have positive measure. Let x be density 
point of E\G and y be a density point of G \ F. Let a be the hyperplane through the origin such 
that ax = y. Then f{y) > t > /(x), so r ^ a and y G Hf:. Define B = ri{G\E)ria{E\C). By 
considering a small neighborhood around y and its reflection under a, we see that B has positive 
measure. Then for x G F it holds that /(x) > /(ax) and h{ax) > h{x). □ 


Claim A.4. 



hf dm < 



h^fdm 
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Proof. Lemma 4.5 shows that {x,r) > {ax,r) for all x G Hf:. Since {x,r) = R^cosOx, f is an 
increasing function of {x, r) and so f{x) > /{ax) for x G . By definition, h^{x) > h^{ax) for x G 
For ai, 02 , 6 i, 62 € K- with oi > 02 and 61 > 62 , it is easy to show that 0162 + 02^1 < 0161 + 0262 , 
with sluict inequality if oi > 02 and 61 > 62 . In our c^e, this imphes that h{x)f(x) + h{ax)f{ajp) < 
h'^{x)f{x) + h'^{ax)f{ax) for all x G Hf:\B and h{x)f{x) + h{ax)f{ax) < {x)f{x) + h^{ax)f{ax) 

for all X G B. The claim follows: 


/ h{x)f{x)dm{ 

Js^ 


X) = 


< 


/ h{x)f{x) + h{ax)f{ax)dm{x) 

Jh+ 

f h^{x)f{x) + h"{ax)f{ax) dm{x) 
Jh+ 

[ h^{x)f{x)i 

Jsi 


I dm. 


□ 


Using this claim, we can complete the proof. Note that h and have the same norm. Then 


D{h) 


{h — /)^ dm 



2hf + dm > 



2h^ f + dm 



ffdm = D{h'), 


which is a contradiction. 


□ 


B Proofs omitted from Section 5 

The proofs in this section follow those of Carlen and Loss [CL90]. Recall the following definitions: 

R(1 -p2)i-V2 


Qp{u,v) = 

Up,N{y,z) = 


n 1 | ||.^ _ "■ 

(1 - p2)l-n/2 

(1 - r‘^{y,z))A{y,z)^ 


where 


A/ \ 1 , 

My, z) = - M -+ 




1 + z) 


- pM I- 


R2 


1 - 


R2 


and 


r{y,z) = 


p 1- 


2x1/2 


1+ 


1 - 


2x1/2 


w 


My,z) 

As above, we set R = a/N — n — 3 and define to be the surface area of 5'^“^. 


B.l Proof of Lemma 5.3 


Lemma 5.3. Let u = I y, I 1 — 


2x1/2 


2 ^ G ^ such that y G and w G ^ ^ as in (3). 
Likewise, let v = G such that 

R(l-r2) 


z G and x G Then 


Qpiu,v) = Up^N{y,z) 


\S^ 


-n-l\ 


rc — rx 


\N-n 


and r G [0,1). 
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The proof is outlined in [CL90] . 

Proof. We want to find A{x,y) and r{x,y) such that 


|n — pv\\‘^ = A\\w — rx\\‘^ . 


Since \\w\\ = ||x|| = R, the left hand side is 


\u-pv\\‘^ = lly-pzf + 


1/2 


1/2 


1 - 




W — \ 1 — 


= R^{l + p^ - ) - 2p ( 1 - 




i ?2 

2 


px 


R‘^ 



1/2 


1 - 


R^ 


{w, x). 


The right hand side is 
Setting 

we get that 

Setting 


A lliu — rx||^ = AR?{1 + r^) — 2Ar{w, x). 


2Ar = 2p\l- 


r = 


p 1- 



1/2 


A 


AR^(l + r^) = R^ 1^1 +p^ - ^{y,z) 
and substituting in the above value for s, we get the equation 

l2 / 1 , „2 ‘^P /I , „2 ^ 


A -[l + p -^{y,z)\A + p ( 1 “ 


1 - 


i 22 


= 0 . 


Solving, we obtain 


A = 


^ ~ z) 


wl 


/1 + p 2 — ^{y, z) 


- p 2 1 _ 


R‘^ 


1 - 


i ?2 


So we have that 


Qp{u,v) = 


R{1 - p 2 )i-V 2 

I^AT-n-lj 11^ _ 

R{i - p^y-^/^ 
\SN-n-l\A^ llw - 

(1-/)2)1-^/2 ^ ^ R(l-r^) 


(1 — J \\S^ ^llltc — rx||^ ^ J 

TT f ^ i?(l-r 2 ) 

= RoN(V,z) - r :—■ 

The fact that r G [0,1) follows from Lemma 5.8, which prove below in Appendix B.4. 


□ 
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B.2 Proof of Lemma 5.4 
Lemma 5.4. \\m.M^oo Up^N{y,z) = Up{y,z). 

This lemma is stated without proof in [CL90]. We give a proof for completeness. 
Proof. First, note that 


1 - A / ^ 1 + 

hm A{y, z) = + 

N^oo 2 


1 + p^y i + p^ ^ i-p- 


so 


lim r{y, z) = p. 
N—^OO 


Therefore, it suffices to show that 


N^OO 

An easy calculation shows that 
2 / 


' N-n f ydlyW'^ + \\z\\'^) — 2p{y, z)) 

hm A{y,z)— = exp ... 


1 +/5^ - 


1 - ^ 1 - 


M R2 


2 ( 1 -p 2 ) 


' i-p^ I pH\\yf + \\zf) - p(i + P^){y,z) + 0(1)' 


i22(l _p2) 


Plugging this in to the dehnition of A, we get 

pH\\yf + \\zf)-2p{y,z) + o{l) 


A{y,z) = 1 + 


i? 2 (l_p 2 ) 


Since /2 + o{R^)^ 


- -V-" / — (/o^dlyll^ + llzll^) — 2/9(?/,2;)) 

hm A{y,z)— = exp .. . 


N^oo 


2 ( 1 -p 2 ) 


as desired. 


□ 


B.3 Proof of Lemma 5.5 

Lemma 5.5. = 1- 

To prove this, we need the following corollary of the Poisson Integral Formula (e.g., Theorem 3.43 
of [MPIO]). 


Corollary B.l. For 0 < r < 1, 


1 — 


\N 


duj{v) = 1. 


||m — rv\ 

Proof of Lemma 5.5. Using Corollary B.l, a simple change of variables shows that 


lire — rx 


R 


□ 
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B.4 Proof of Lemma 5.8 


Lemma 5.8. (1 — 


2 \ 1/2 




1 - 


2 \ 1/2 




< A{y^z). 

The proof is given in [CL90]. We include it for completeness. 

Proof. Assume that ||y|| < R and ||z|| < R. Otherwise, the claim is trivial. Define A! as follows: 


. 4 '= 1 ±Tz 


2 2p 


+ 


\ 


'l + p 2 - 


w 


1 _ 


R^ 


1 - 




By Cauchy-Schwarz, we know that A! < A, so it suffices to show that 


1/2 


1/2 


1 - 


R? 


1 - 




< A'. 


Now define a = (1 — 


2x1/2 




,/?= !- 


'W 


2x1/2 


, and let B = 


l+p^—2p\/l—a^\/l— 


2ap 


Then 






2a;S 


a/3 


a/3 ’ 


so we will show that 


1 <B + yjB'^ - p2 


This statement, in turn, is implied by 


1 + 


< B. 


To prove this, observe that for any a, 13, 

(l-a2)(l-/32) < (l-af3f 


and for any p, 

Then 

Rearranging, we see that 


2p < 1 + 


2p \/1 - a2x/l - /32 < (1 + p^){l - a/3). 


1 + ^ 1 + p2 _ 2pVl - - / 3 ‘^ 

2 - ^ 


□ 
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